Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ฎ Reinforcement Learning
RLHF, Policy Gradient, Reward Models, Agent Training
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
22702
posts in
15.4
ms
Are
AIs
more likely to
pursue
on-episode or beyond-episode reward?
lesswrong.com
ยท
10h
๐พ
Agent Memory
The
OODA
Loop
Pattern
for Autonomous AI Agents โ How I Built a Self-Improving System
dev.to
ยท
1d
ยท
Discuss:
DEV
๐
Sovereign AI Infrastructure
Claude
Opus
4.6 Introduces Adaptive Reasoning and Context
Compaction
for Long-Running Agents
infoq.com
ยท
18h
๐ญ
Anthropic Claude
How A
Regular
Person Can
Utilize
AI Agents
weightythoughts.com
ยท
1d
โ๏ธ
Prompt Engineering
Beyond the
hype
: A real-world guide to building
enterprise-grade
AI agents
thoughtworks.com
ยท
1d
๐
Sovereign AI Infrastructure
From field
experiments
to policy
interventions
at scale
science.org
ยท
15h
๐
Complexity Economics
๐ฎ
Reinforcement
Learning
Explained
Like You're 5
sreekarreddy.com
ยท
4d
ยท
Discuss:
DEV
โ๏ธ
Prompt Engineering
Agents need
vector
search more than
RAG
ever did
venturebeat.com
ยท
7h
๐พ
Agent Memory
Escaping
the โDemo
Trap
โ: A Guide to Engineering Reliable AI Agents
dzone.com
ยท
13h
๐
AGENTS.md
Understanding AI Agents for Data Scientists: The
Basic
Loop
datascienceweekly.substack.com
ยท
1d
ยท
Discuss:
Substack
๐ค
AI Tools
roli-lpci/zer0dex
: Dual-layer memory for AI agents. Compressed index + vector store. 91% recall, 70ms, fully local.
github.com
ยท
13h
ยท
Discuss:
r/Python
๐พ
Agent Memory
From
Latency
to Streaming: Optimization Strategies for Multi-Agent Systems with Google
ADK
medium.com
ยท
1d
๐ง
Context Engineering
Introducing
the new AI agents
oodaloop.com
ยท
1d
๐
AGENTS.md
Teaching
AI to
Escape
: The Power of Deep Reinforcement Learning
dev.to
ยท
4d
ยท
Discuss:
DEV
๐
Sovereign AI Infrastructure
Investing
in Mind
Robotics
a16z.news
ยท
1d
๐ญ
Anthropic Claude
Power Steering: Behavior Steering via Layer-to-Layer
Jacobian
Singular
Vectors
lesswrong.com
ยท
24m
๐
Model Routing
Every minute you
aren
โt running 69 agents, you are
falling
behind
geohot.github.io
ยท
2d
ยท
Discuss:
Hacker News
๐
Adaptive Markets
Mitigating
The Risk of Prompt Injection for AI Agents on
Databricks
databricks.com
ยท
1d
๐
AGENTS.md
I built
MEO
: a runtime that lets AI agents learn from past
executions
(looking for feedback)
github.com
ยท
1d
ยท
Discuss:
r/Python
๐ง
Context Engineering
The
Mathematics
of Game Theory
geopoliticsreport.substack.com
ยท
3d
ยท
Discuss:
Substack
โ๏ธ
Game Theory
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help